Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 20 de 32
Filter
1.
IEEE Access ; : 1-1, 2023.
Article in English | Scopus | ID: covidwho-20242834

ABSTRACT

During the formation of medical images, they are easily disturbed by factors such as acquisition devices and tissue backgrounds, causing problems such as blurred image backgrounds and difficulty in differentiation. In this paper, we combine the HarDNet module and the multi-coding attention mechanism module to optimize the two stages of encoding and decoding to improve the model segmentation performance. In the encoding stage, the HarDNet module extracts medical image feature information to improve the segmentation network operation speed. In the decoding stage, the multi-coding attention module is used to extract both the position feature information and channel feature information of the image to improve the model segmentation effect. Finally, to improve the segmentation accuracy of small targets, the use of Cross Entropy and Dice combination function is proposed as the loss function of this algorithm. The algorithm has experimented on three different types of medical datasets, Kvasir-SEG, ISIC2018, and COVID-19CT. The values of JS were 0.7189, 0.7702, 0.9895, ACC were 0.8964, 0.9491, 0.9965, SENS were 0.7634, 0.8204, 0.9976, PRE were 0.9214, 0.9504, 0.9931. The experimental results showed that the model proposed in this paper achieved excellent segmentation results in all the above evaluation indexes, which can effectively assist doctors to diagnose related diseases quickly and improve the speed of diagnosis and patients’quality of life. Author

2.
Social Semiotics ; 33(2):278-285, 2023.
Article in English | ProQuest Central | ID: covidwho-20236514

ABSTRACT

In China and around the world, the global spread of COVID-19 has made wearing a facemask more than a pragmatic or aesthetic individual-level issue: it has instilled in people deontic value. In Chinese anti-epidemic narratives, the semiotic ideology of wearing a facemask has been closely related to collectivism, patriotism and, to a certain degree, nationalism. The facemask not only serves as a protective biomedical device but also as a cultural, political and spatial sign of the line of defence against disorders of the natural system, to establish the order of the social system. This paper argues from the perspective of semiotics and life politics that such mask narratives have effectively helped China prevent the large-scale spread of the epidemic across the nation and have served as a means of collective psychotherapy, paradoxically transforming individual separation into collective spiritual cohesion. Previous semiotic studies of disaster have not paid much attention to plagues or disaster governance discourse, between which biomedicine plays an important role. Thus, this paper aims to shed light on how biomedicine works with politics in coding and decoding the relationship between the natural system of the plague and the social system of governance.

3.
International Journal of Advanced Computer Science and Applications ; 14(4):456-463, 2023.
Article in English | Scopus | ID: covidwho-2321413

ABSTRACT

Online learning has gained a tremendous popularity in the last decade due to the facility to learn anytime, anything, anywhere from the ocean of web resources available. Especially the lockdown all over the world due to the Covid-19 pandemic has brought an enormous attention towards the online learning for value addition and skills development not only for the school/college students, but also to the working professionals. This massive growth in online learning has made the task of assessment very tedious and demands training, experience and resources. Automatic Question generation (AQG) techniques have been introduced to resolve this problem by deriving a question bank from the text documents. However, the performance of conventional AQG techniques is subject to the availability of large labelled training dataset. The requirement of deep linguistic knowledge for the generation of heuristic and hand-crafted rules to transform declarative sentence into interrogative sentence makes the problem further complicated. This paper presents a transfer learning-based text to text transformation model to generate the subjective and objective questions automatically from the text document. The proposed AQG model utilizes the Text-to-Text-Transfer-Transformer (T5) which reframes natural language processing tasks into a unified text-to-text-format and augments it with word sense disambiguation (WSD), ConceptNet and domain adaptation framework to improve the meaningfulness of the questions. Fast T5 library with beam-search decoding algorithm has been used here to reduce the model size and increase the speed of the model through quantization of the whole model by Open Neural Network Exchange (ONNX) framework. The keywords extraction in the proposed framework is performed using the Multipartite graphs to enhance the context awareness. The qualitative and quantitative performance of the proposed AQG model is evaluated through a comprehensive experimental analysis over the publicly available Squad dataset. © 2023, International Journal of Advanced Computer Science and Applications. All Rights Reserved.

4.
Ieee Access ; 11:595-645, 2023.
Article in English | Web of Science | ID: covidwho-2311192

ABSTRACT

Biomedical image segmentation (BIS) task is challenging due to the variations in organ types, position, shape, size, scale, orientation, and image contrast. Conventional methods lack accurate and automated designs. Artificial intelligence (AI)-based UNet has recently dominated BIS. This is the first review of its kind that microscopically addressed UNet types by complexity, stratification of UNet by its components, addressing UNet in vascular vs. non-vascular framework, the key to segmentation challenge vs. UNet-based architecture, and finally interfacing the three facets of AI, the pruning, the explainable AI (XAI), and the AI-bias. PRISMA was used to select 267 UNet-based studies. Five classes were identified and labeled as conventional UNet, superior UNet, attention-channel UNet, hybrid UNet, and ensemble UNet. We discovered 81 variations of UNet by considering six kinds of components, namely encoder, decoder, skip connection, bridge network, loss function, and their combination. Vascular vs. non-vascular UNet architecture was compared. AP(ai)Bias 2.0-UNet was identified in these UNet classes based on (i) attributes of UNet architecture and its performance, (ii) explainable AI (XAI), and, (iii) pruning (compression). Five bias methods such as (i) ranking, (ii) radial, (iii) regional area, (iv) PROBAST, and (v) ROBINS-I were applied and compared using a Venn diagram. Vascular and non-vascular UNet systems dominated with sUNet classes with attention. Most of the studies suffered from a low interest in XAI and pruning strategies. None of the UNet models qualified to be bias-free. There is a need to move from paper-to-practice paradigms for clinical evaluation and settings.

5.
International Journal of Advanced Computer Science and Applications ; 14(3):640-649, 2023.
Article in English | Scopus | ID: covidwho-2300359

ABSTRACT

In December 2019, the COVID-19 epidemic was found in Wuhan, China, and soon hundreds of millions were infected. Therefore, several efforts were made to identify commercially available drugs to repurpose them against COVID-19. Inferring potential drug indications through computational drug repositioning is an efficient method. The drug repositioning problem is a top-K recommendation function that presents the most likely drugs for specific diseases based on drug and disease-related data. The accurate prediction of drug-target interactions (DTI) is very important for drug repositioning. Deep learning (DL) models were recently exploited for promising DTI prediction performance. To build deep learning models for DTI prediction, encoder-decoder architectures can be utilized. In this paper, a deep learning-based drug repositioning approach is proposed, which is composed of two experimental phases. Firstly, training and evaluating different deep learning encoder-decoder architecture models using the benchmark DAVIS Dataset. The trained deep learning models have been evaluated using two evaluation metrics;mean square error and the concordance index. Secondly, predicting antiviral drugs for Covid-19 using the trained deep learning models created during the first phase. In this phase, these models have been experimented to predict different antiviral drug lists, which then have been compared with a recently published antiviral drug list for Covid-19 using the concordance index metric. The overall experimental results of both phases showed that the most accurate three deep learning compound-encoder/protein-encoder architectures are Morgan/AAC, CNN/AAC, and CNN/CNN with best values for the mean square error, the first phase concordance index, and the second phase concordance index. © 2023,International Journal of Advanced Computer Science and Applications. All Rights Reserved.

6.
Expert Systems with Applications ; 224, 2023.
Article in English | Scopus | ID: covidwho-2297620

ABSTRACT

This study aims to estimate the prices in the next 24 h with deep learning methods in the Turkish electricity market. The model is based on hourly data for the period 2017–2021 using electricity prices. The model's Root Mean Square Error (RMSE) value is 3.14, and the explanatory power R2 is 0.94. Since this model also considers the subgroups in the database, it can make price predictions for the pandemic period. To test the robustness and consistency of the model, twelve RNN-based models were re-estimated with the same data set. Although all models successfully predict the prices, The TEDSE Model performs better than the others. This study will be especially beneficial to electricity market players and policymakers. In further studies, the TEDSE model can be used for price prediction in intraday energy markets. This study's most important contribution is methodology innovation, using the Transformer Encoder-Decoder with Self-Attention (TEDSE) model for the first time to estimate electricity prices. © 2023 Elsevier Ltd

7.
IEEE Transactions on Instrumentation and Measurement ; : 1-1, 2023.
Article in English | Scopus | ID: covidwho-2296656

ABSTRACT

Recently, accurate segmentation of COVID-19 infection from computed tomography (CT) scans is critical for the diagnosis and treatment of COVID-19. However, infection segmentation is a challenging task due to various textures, sizes and locations of infections, low contrast, and blurred boundaries. To address these problems, we propose a novel Multi-scale Wavelet Guidance Network (MWG-Net) for COVID-19 lung infection by integrating the multi-scale information of wavelet domain into the encoder and decoder of the convolutional neural network (CNN). In particular, we propose the Wavelet Guidance Module (WGM) and Wavelet &Edge Guidance Module (WEGM). Among them, the WGM guides the encoder to extract infection details through the multi-scale spatial and frequency features in the wavelet domain, while the WEGM guides the decoder to recover infection details through the multi-scale wavelet representations and multi-scale infection edge information. Besides, a Progressive Fusion Module (PFM) is further developed to aggregate and explore multi-scale features of the encoder and decoder. Notably, we establish a COVID-19 segmentation dataset (named COVID-Seg-100) containing 5800+ annotated slices for performance evaluation. Furthermore, we conduct extensive experiments to compare our method with other state-of-the-art approaches on our COVID-19-Seg-100 and two publicly available datasets, i.e., MosMedData and COVID-SemiSeg. The results show that our MWG-Net outperforms state-of-the-art methods on different datasets and can achieve more accurate and promising COVID-19 lung infection segmentation. IEEE

8.
6th International Conference on Electronics, Communication and Aerospace Technology, ICECA 2022 ; : 1199-1206, 2022.
Article in English | Scopus | ID: covidwho-2273654

ABSTRACT

Drug Target Interaction (DTI) prediction is an important factor is drug discovery and repositioning (DDR) since it detects the response of a drug over a target protein. The Coronavirus disease 2019 (COVID-19) disease created groups of deadly pneumonia with clinical appearance mostly similar to SARS-CoV. The precise diagnosis of COVID-19 clinical outcome is more challenging, since the diseases has various forms with varying structures. So predicting the interactions between various drugs with the SARS-CoV target protein is very crucial need in these days, which may leads to discovery of new drugs for the deadly disease. Recently, Deep learning (DL) techniques have been applied by the researches for DTI prediction. Since CNN is one of the major DL models which has the ability to create predictive feature vectors or embeddings, CNN-OSBO encoder-decoder architecture for DTI prediction of Covid-19 targets has been designed Given the input drug and Covid-19 target pair of data, they are fed into the Convolution Neural Networks (CNN) with Opposition based Satin Bowerbird Optimizer (OSBO) encoder modules, separately. Here OSBO is utilized for regulating the hyper parameters (HPs) of CNN layers. Both the encoded data are then embedded to create a binding module. Finally the CNN Decoder module predicts the interaction of drugs over the Covid-19 targets by returning an affinity or interaction score. Experimental results state that DTI prediction using CNN+OSBO achieves better accuracy results when compared with the existing techniques. © 2022 IEEE.

9.
9th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2022 ; 2022.
Article in English | Scopus | ID: covidwho-2271753

ABSTRACT

In this study, we present an in-depth analysis of users' propensity toward negative and hateful behavior during the COVID-19 pandemic. We analyze a large dataset extracted from Twitter from the months of January 2020 up until June 2020. The dataset includes 2,470,888 tweets from 3,269 users who are active over a period of six months. We model users' propensity toward hateful content over time by leveraging Random Forest regressor model and Long Short-Term Memory (LSTM) based many-to-one and Sequence2Sequence models for both short and long-term predictions. Our models leverage a set of features for each user, including the user's psychological traits. We also study the impact of external triggers, such as COVID-related news concurrent with the users' activities. To encode popular news, we propose using encoder states of a Sequence2Sequence model as features for a Tree-based regressor. The regressor, when combined with the vectorized news, results in an accurate prediction of tweeter's hateful behavior in the short (decoder size of four weeks) and long term (decoder size of 10 weeks) with a total training data of 15 weeks x 3269 users. We also show that our model accurately profiles selected groups of users, as they are defined by specific psychological traits. © 2022 IEEE.

10.
5th International Conference on Algorithms, Computing and Artificial Intelligence, ACAI 2022 ; 2022.
Article in English | Scopus | ID: covidwho-2265590

ABSTRACT

Our work aims to generate new ideas to explore in a specific domain using generative language models. For example, doctors can write about known symptoms as cues to the system, and then the system will generate ideas based on the cues. Similar scenarios can be thought of for other scientific domains. We used transformer-based decoders, especially GPT3-based transformer decoders, as the language models and generators. As the data, we used COVID-19 open research dataset [18]. We finetuned GPT-NEO-125M and GPT-NEO-1.3B models with 125 million and 1.3 billion parameters, respectively. The later model generated more coherent text and could link ideas relevant to the same problem better. We report here our findings with examples generated from our finetuned models. © 2022 ACM.

11.
IEEE Access ; 11:16621-16630, 2023.
Article in English | Scopus | ID: covidwho-2281059

ABSTRACT

Medical image segmentation is a crucial way to assist doctors in the accurate diagnosis of diseases. However, the accuracy of medical image segmentation needs further improvement due to the problems of many noisy medical images and the high similarity between background and target regions. The current mainstream image segmentation networks, such as TransUnet, have achieved accurate image segmentation. Still, the encoders of such segmentation networks do not consider the local connection between adjacent chunks and lack the interaction of inter-channel information during the upsampling of the decoder. To address the above problems, this paper proposed a dual-encoder image segmentation network, including HarDNet68 and Transformer branch, which can extract the local features and global feature information of the input image, allowing the segmentation network to learn more image information, thus improving the effectiveness and accuracy of medical segmentation. In this paper, to realize the fusion of image feature information of different dimensions in two stages of encoding and decoding, we propose a feature adaptation fusion module to fuse the channel information of multi-level features and realize the information interaction between channels, and then improve the segmentation network accuracy. The experimental results on CVC-ClinicDB, ETIS-Larib, and COVID-19 CT datasets show that the proposed model performs better in four evaluation metrics, Dice, Iou, Prec, and Sens, and achieves better segmentation results in both internal filling and edge prediction of medical images. Accurate medical image segmentation can assist doctors in making a critical diagnosis of cancerous regions in advance, ensure cancer patients receive timely targeted treatment, and improve their survival quality. © 2013 IEEE.

12.
IEEE Journal on Selected Areas in Communications ; 41(1):107-118, 2023.
Article in English | Scopus | ID: covidwho-2245641

ABSTRACT

Video represents the majority of internet traffic today, driving a continual race between the generation of higher quality content, transmission of larger file sizes, and the development of network infrastructure. In addition, the recent COVID-19 pandemic fueled a surge in the use of video conferencing tools. Since videos take up considerable bandwidth ( ∼ 100 Kbps to a few Mbps), improved video compression can have a substantial impact on network performance for live and pre-recorded content, providing broader access to multimedia content worldwide. We present a novel video compression pipeline, called Txt2Vid, which dramatically reduces data transmission rates by compressing webcam videos ('talking-head videos') to a text transcript. The text is transmitted and decoded into a realistic reconstruction of the original video using recent advances in deep learning based voice cloning and lip syncing models. Our generative pipeline achieves two to three orders of magnitude reduction in the bitrate as compared to the standard audio-video codecs (encoders-decoders), while maintaining equivalent Quality-of-Experience based on a subjective evaluation by users ( n=242 ) in an online study. The Txt2Vid framework opens up the potential for creating novel applications such as enabling audio-video communication during poor internet connectivity, or in remote terrains with limited bandwidth. The code for this work is available at https://github.com/tpulkit/txt2vid.git. © 1983-2012 IEEE.

13.
13th International Conference on Computing Communication and Networking Technologies, ICCCNT 2022 ; 2022.
Article in English | Scopus | ID: covidwho-2213226

ABSTRACT

Covid-19 has had an adverse effect on the world, with more than 440 million cases recorded so far. The outbreak has hampered the country's healthcare and economy. This calls for an accurate prediction model for the prediction of Covid Cases, so that it gives some time to the hospitals and administration, to make the necessary arrangement. For population-dense countries like India, the covid case dynamics of every district is different, hence this requires a district-wise case prediction of Covid Cases. In this paper, we perform prediction of covid cases across all districts of India using different architectures of Long short-term memory (LSTM) and performed a comparative analysis between them. To the best of our knowledge, this is the first such attempt at the district level. Bidirectional LSTM encoder-decoder outperformed other LSTM-based models and, gave a test set MAPE of 15.44, followed by LSTM Encoder Decoder, giving a MAPE of 19.72. © 2022 IEEE.

14.
3rd International Symposium on Artificial Intelligence for Medical Sciences, ISAIMS 2022 ; : 542-546, 2022.
Article in English | Scopus | ID: covidwho-2194149

ABSTRACT

The coronavirus disease (COVID-19) pandemic has contribute to a harsh effect on the global public health. Computed Tomography (CT) is an effective tool in the screening of COVID-19. It is greater important to rapidly and accurately segment COVID-19 from CT to help diagnostic and monitor patients. In this paper, we propose a Progressive encoder and decoder U-Net++ based segmentation network using attention mechanism. In terms of COVID-19 lesion segmentation problems with highly imbalanced dataset and small regions of interests (ROI), we will use a progressive encoder and decoder combined with dilated convolution to form a deeper network structure, which can extract more and lower level semantic features while ensuring spatial information features. We propose to incorporate an attention mechanism to a progressive encoder and decoder U-Net++ architecture to capture rich contextual relationships for better feature representations. Meanwhile, the focal tversky loss is enhanced to address the small lesion segmentation. In addition, after combining the advantages of multiple modules, the network parameters will increase abruptly. According to the performance of the model in the validation set, we cut the redundant branch of the network model to do the final segmentation test, which can not only reduce the segmentation accuracy, but also reduce the network parameters and calculation cost. The experiment results, evaluated on a small dataset where only 3520 CT images are available, prove the enhanced model can achieve an accurate result on COVID-19 segmentation. The obtained Dice Score, Sensitivity and Specificity are 70.1%, 82.1%, and 92.3%, respectively. © 2022 ACM.

15.
IEEE Journal on Selected Areas in Communications ; : 1-1, 2022.
Article in English | Scopus | ID: covidwho-2152491

ABSTRACT

Video represents the majority of internet traffic today, driving a continual race between the generation of higher quality content, transmission of larger file sizes, and the development of network infrastructure. In addition, the recent COVID-19 pandemic fueled a surge in the use of video conferencing tools. Since videos take up considerable bandwidth (~100 Kbps to a few Mbps), improved video compression can have a substantial impact on network performance for live and pre-recorded content, providing broader access to multimedia content worldwide. We present a novel video compression pipeline, called Txt2Vid, which dramatically reduces data transmission rates by compressing webcam videos (“talking-head videos”) to a text transcript. The text is transmitted and decoded into a realistic reconstruction of the original video using recent advances in deep learning based voice cloning and lip syncing models. Our generative pipeline achieves two to three orders of magnitude reduction in the bitrate as compared to the standard audio-video codecs (encoders-decoders), while maintaining equivalent Quality-of-Experience based on a subjective evaluation by users (n = 242) in an online study. The Txt2Vid framework opens up the potential for creating novel applications such as enabling audio-video communication during poor internet connectivity, or in remote terrains with limited bandwidth. The code for this work is available at https://github.com/tpulkit/txt2vid.git. IEEE

16.
Comput Biol Med ; 151(Pt A): 106306, 2022 Dec.
Article in English | MEDLINE | ID: covidwho-2104651

ABSTRACT

The outbreak of new coronary pneumonia has brought severe health risks to the world. Detection of COVID-19 based on the UNet network has attracted widespread attention in medical image segmentation. However, the traditional UNet model is challenging to capture the long-range dependence of the image due to the limitations of the convolution kernel with a fixed receptive field. The Transformer Encoder overcomes the long-range dependence problem. However, the Transformer-based segmentation approach cannot effectively capture the fine-grained details. We propose a transformer with a double decoder UNet for COVID-19 lesions segmentation to address this challenge, TDD-UNet. We introduce the multi-head self-attention of the Transformer to the UNet encoding layer to extract global context information. The dual decoder structure is used to improve the result of foreground segmentation by predicting the background and applying deep supervision. We performed quantitative analysis and comparison for our proposed method on four public datasets with different modalities, including CT and CXR, to demonstrate its effectiveness and generality in segmenting COVID-19 lesions. We also performed ablation studies on the COVID-19-CT-505 dataset to verify the effectiveness of the key components of our proposed model. The proposed TDD-UNet also achieves higher Dice and Jaccard mean scores and the lowest standard deviation compared to competitors. Our proposed method achieves better segmentation results than other state-of-the-art methods.


Subject(s)
COVID-19 , Communication Aids for Disabled , Humans , COVID-19/diagnostic imaging , Algorithms , Heart
17.
3rd International Workshop of Advances in Simplifying Medical Ultrasound, ASMUS 2022, held in Conjunction with 25th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2022 ; 13565 LNCS:23-33, 2022.
Article in English | Scopus | ID: covidwho-2059734

ABSTRACT

The need for summarizing long medical scan videos for automatic triage in Emergency Departments and transmission of the summarized videos for telemedicine has gained significance during the COVID-19 pandemic. However, supervised learning schemes for summarizing videos are infeasible as manual labeling of scans for large datasets is impractical by frontline clinicians. This work presents a methodology to summarize ultrasound videos using completely unsupervised learning schemes and is validated on Lung Ultrasound videos. A Convolutional Autoencoder and a Transformer decoder is trained in an unsupervised reinforcement learning setup i.e., without supervised labels in the whole workflow. Novel precision and recall computation for ultrasound videos is also presented employing which high Precision and F1 scores of 64.36% and 35.87% with an average video compression rate of 78% is obtained when validated against clinically annotated cases. Even though demonstrated using lung ultrasound videos, our approach can be readily extended to other imaging modalities. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

18.
IEEE Transactions on Instrumentation and Measurement ; : 1-1, 2022.
Article in English | Scopus | ID: covidwho-1992679

ABSTRACT

The spread of COVID-19 has brought a huge disaster to the world, and the automatic segmentation of infection regions can help doctors to make diagnosis quickly and reduce workload. However, there are several challenges for the accurate and complete segmentation, such as the scattered infection area distribution, complex background noises, and blurred segmentation boundaries. To this end, in this paper, we propose a novel network for automatic COVID-19 lung infection segmentation from CT images, named BCS-Net, which considers the boundary, context, and semantic attributes. The BCS-Net follows an encoder-decoder architecture, and more designs focus on the decoder stage that includes three progressively Boundary- Context-Semantic Reconstruction (BCSR) blocks. In each BCSR block, the attention-guided global context (AGGC) module is designed to learn the most valuable encoder features for decoder by highlighting the important spatial and boundary locations and modeling the global context dependence. Besides, a semantic guidance (SG) unit generates the semantic guidance map to refine the decoder features by aggregating multi-scale high-level features at the intermediate resolution. Extensive experiments demonstrate that our proposed framework outperforms the existing competitors both qualitatively and quantitatively. IEEE

19.
Cell Rep ; 40(9): 111300, 2022 08 30.
Article in English | MEDLINE | ID: covidwho-1982705

ABSTRACT

Synthetic mRNA technology is a promising avenue for treating and preventing disease. Key to the technology is the incorporation of modified nucleotides such as N1-methylpseudouridine (m1Ψ) to decrease immunogenicity of the RNA. However, relatively few studies have addressed the effects of modified nucleotides on the decoding process. Here, we investigate the effect of m1Ψ and the related modification pseudouridine (Ψ) on translation. In a reconstituted system, we find that m1Ψ does not significantly alter decoding accuracy. More importantly, we do not detect an increase in miscoded peptides when mRNA containing m1Ψ is translated in cell culture, compared with unmodified mRNA. We also find that m1Ψ does not stabilize mismatched RNA-duplex formation and only marginally promotes errors during reverse transcription. Overall, our results suggest that m1Ψ does not significantly impact translational fidelity, a welcome sign for future RNA therapeutics.


Subject(s)
COVID-19 Vaccines , COVID-19 , COVID-19/prevention & control , Humans , Nucleotides , Proteins , Pseudouridine/genetics , RNA , RNA, Messenger/genetics , RNA, Messenger/metabolism , Vaccines, Synthetic , mRNA Vaccines
20.
4th IEEE International Conference on Design and Test of Integrated Micro and Nano-Systems, DTS 2022 ; 2022.
Article in English | Scopus | ID: covidwho-1973452

ABSTRACT

Nowadays, streaming applications have been in great demand, especially due to covid-19 (teleworking, online teaching, virtual reality, etc.). In addition, artificial intelligence has become widely used especially in video processing domains, so a video with high quality improves the accuracy rate of this application. To meet these needs, the Versatile Video Coding standard (VVC) has appeared to give a high compression efficiency compared to high-efficiency video coding. This norm consists of a high complexity algorithm that offers an improvement in processing time and decreases the bit rate by 50 % thanks to several new compression techniques. In this context, we propose the implementation of an intra prediction decoding chain of this standard on a system on chip. In this work, we highlight the VVC feature enhancements, we present the suitable method for VVC intra-prediction decoder implementation on the PYNQ-Z2, and we provide profiling in terms of decoding time and power consumption. As a future work, this study is helpful to distinguish the block that will be a candidate for a Hardware acceleration. © 2022 IEEE.

SELECTION OF CITATIONS
SEARCH DETAIL